Construction of Disease-Symptom Knowledge Graph from Web-Board Documents

نویسندگان

چکیده

The research aim is to construct a disease-symptom knowledge graph (DSKG) as cause-effect containing relations relation type determined from downloaded documents on medical web-board resources. Each connects disease-name concept node (a causative-concept node) corresponding having group of correlated symptom-concept/effect-concept features common among some concepts. DSKG benefits non-professionals in preliminary diagnosis through recommender web-board. There are three main problems: how determine symptom concepts sentences without annotation the documents’ topic-names; with/without complications; and involving high dimensional symptom-concept after union groups. Therefore, we apply word co-occurrence pattern including medical-symptom expressions Wikipedia MeSH Lexitron Dictionary Cartesian product applied for automatic-supervised machine learning relation. We propose using Principal Component Analysis constructing by dimensionality reduction with minimized information loss. In contrast previous works, proposed approach enables construction precise concise representation scores 7.8 9, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Extraction of Knowledge from Web Documents

A large amount of digital information available is written as text documents in the form of web pages, reports, papers, emails, etc. Extracting the knowledge of interest from such documents from multiple sources in a timely fashion is therefore crucial. This paper provides an update on the Artequakt system which uses natural language tools to automatically extract knowledge about artists from m...

متن کامل

Controlled Knowledge Base Enrichment from Web Documents

The Linked Open Data initiative brought more and more RDF data sources to be published on the Web. However, these data sources contain relatively little information compared to the documents available on the surface Web. Many annotation tools have been proposed in the last decade for the automatic construction and enrichment of knowledge bases. But, while noticeable advances are achieved for th...

متن کامل

Cultural Heritage: Knowledge Extraction from Web Documents

This article presents the use of NLP techniques (text mining, text analysis) to develop specific tools that allow to create linguistic resources related to the cultural heritage domain. The aim of our approach is to create tools for the building of an online “knowledge network”, automatically extracted from text materials concerning this domain. A particular methodology was experimented by divi...

متن کامل

Keyword Suggestion Using Concept Graph Construction from Wikipedia Rich Documents

Concept graph is a graph in which nodes are concepts and the edges indicate the relationship between the concepts. Creation of concept graphs is a hot topic in the area of knowledge discovery. Natural Language Processing (NLP) based concept graph creation is one of the efficient but costly methods in the field of information extraction. Compared to NLP based methods, Statistical methods have tw...

متن کامل

Embedding Knowledge in Web Documents

The paper argues for the use of general and intuitive knowledge representation languages (and simpler notational variants, e.g. subsets of natural languages) for indexing the content of Web documents and representing knowledge within them. We believe that these languages have advantages over metadata languages based on the Extensible Mark-up Language (XML). Indeed, the retrieval of precise info...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app12136615